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Abstract — A single-user multiple-input multiple-output com- 
munication scheme is considered where a transmitter, equipped 
with multiple antennas, encodes the data into independent 
streams via a single linear code. The coded streams are then 
precoded using linear space-time modulation from the class of 
perfect codes. At the receiver side, integer-forcing equalization is 
applied, followed by standard single-stream decoding. It is shown 
that this communication architecture achieves the capacity of any 
multiple-input multiple-output channel up to a constant gap. 



I. Introduction 

The Gaussian Multiple-Input Multiple-Output (MIMO) 
channel has been the focus of extensive research efforts since 
the pioneering works of Foschini [1], Foschini and Gans 0, 
and Telatar J3). While the capacity of the channel, under 
various assumptions on channel state information, is easy to 
derive, practical schemes that allow to approach the capacity 
are only known for the two extremes: when the channel is 
either ergodic, with or without channel state information at 
the transmitter, or static where the transmitter knows the 
channel matrix. In contrast, this paper considers a static 
(non-ergodic) scenario where the receiver has full channel 
knowledge whereas the transmitter either has no knowledge 
of the channel, or alternatively knows only its capacity. 

More specifically, a single-user complex MIMO channel 



Hx 



(1) 



with M transmit and N receive antennas is considered. The 
input vector x is subject to the power constrain^] 

E(x f x) <MSNR, 

and the additive noise z ~ CAf(0, 1) is a circularly symmetric 
complex normal vector with zero mean and unit variance. 

The mutual information of this channel is maximized using 
a circularly symmetric complex Gaussian input with zero mean 
and covariance matrix Q satisfying tracc(Q) < M-SNR, and 
is given by J3j 



C = 



Q^O 



max log dot (I 

trace Q<M SNR v 



QI^H) 



(2) 



The choice of Q that maximizes (O is determined by the 
water-filling solution. When the matrix H is known at both 
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transmission ends, i.e., in a closed-loop scenario, this mutual 
information is the capacity of the channel and may closely 
be approached using the singular-value decomposition in con- 
junction with standard scalar codes designed for an additive 
white Gaussian noise (AWGN) channel. Often, the sub-optimal 
choice Q = SNR • I is used, resulting in the white-input (WI) 
mutual information 



logdet (l + SNRH f H) 



For all channel matrices H and all values of SNR, the 
WI mutual information loses less than MlogM bits w.r.t. 
capacity |4], Q. Thus, a scheme that performs a constant gap 
from Cwi also performs a constant gap from the closed-loop 
capacity C. In this paper, an open-loop setting is considered. 
In this case, transmitting a white input is a natural choice and 
Cwi will serve as a benchmark in the sequel. 

While the theoretical performance limits of open-loop com- 
munication over a Gaussian MIMO channel are well un- 
derstood, unlike for closed-loop transmission, much is still 
lacking when it comes to practical schemes that are able 
to approach these limits. Such schemes are known for the 
1x2 MISO channel where Alamouti modulation offers an 
optimal solution. More generally, modulation via orthogonal 
space-time block "codes" allows to approach the WI mutual 
information using scalar AWGN coding and decoding in the 
limit of small rate, where the mutual information is governed 
solely through the Frobenius norm of the channel matrix. 

Beyond the low rate regime, i.e., when the multiple de- 
grees of freedom offered by the channel need to be utilized, 
despite considerable work and progress, much remains to 
be hoped for. Since finding a practical scheme that is able 
to approach the WI mutual information with no transmitter 
channel knowledge turned out to be a very challenging task, 
less demanding benchmarks became accepted in the literature. 
First, since statistical modeling of a wireless communication 
link is often available, one may be content with guaranteeing 
good performance only for channel realizations that have a 
"high" probability. Moreover, to further simplify analysis and 
design, the asymptotic criterion of the diversity-multiplexing 
tradeoff [4 1 has widely been adopted. Unfortunately, such char- 
acterizations offer only a coarse figure of merit for assessing 
schemes. 

Realizing that a metric based on a particular statistical 
model is prone to be non-robust to these very assumption, 
a major achievement was the definition of approximately 
universal space-time codes by Tavildar and Vishwanath ||6] 
and in identifying and developing new schemes that satisfy this 
property [7|-[9|. Roughly speaking, approximate-universality 
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guarantees that a scheme is DMT optimal for any statistical 
channel model. 

Approximately universal schemes still suffer from the 
asymptotic nature of the DMT criterion. In particular, since 
uncoded transmission suffices to achieve the optimal DMT, 
knowing that a scheme is DMT optimal does not provide 
any performance guarantee for a channel with fixed (and even 
known) capacity. 

In this work, a practical communication architecture that 
achieves the capacity of any MIMO channel up to a constant 
gap is studied. Specifically, in the considered scheme, the 
transmitter encodes the data into independent streams via 
a single linear code. The coded streams are then linearly 
precoded using the generating matrix of a space-time code 
from the class of perfect codes ll8l- lfTTI . At the receiver side, 
integer-forcing equalization [12 | is applied. 

Integer-forcing equalization essentially reduces to lattice- 
reduction (LR) in the case of uncoded transmission. Lattice- 
reduction aided receivers for perfect space-time modulated 
QAM constellations were considered in the literature, and were 
shown to be DMT optimal 1 1 3 1 . The key difference is that 
while the latter approach involves uncoded transmission and 
symbol-by-symbol detection, the proposed architecture uses 
linearly coded streams and the detection phase is replaced with 
equalization. This is turn leads to performance guarantees that 
are valid at any (fixed) transmission rate. 

II. Preliminaries on Integer-Forcing Equalization 

Integer-Forcing equalization is a low-complexity architec- 
ture for the MIMO channel, which was proposed by Zhan 
et al. Ifl2ll . The key idea underlying IF is to first decode 
linear combinations of the signals transmitted by all antennas, 
and then, after the noise is removed, invert those linear com- 
binations to recover the individual transmitted signals. This 
is made possible by transmitting codewords from the same 
linear/lattice code from all M transmit antennas, leveraging 
the property that linear codes are closed under (modulo) linear 
combinations with integer- valued coefficients. 

A. Nested Lattice Codes 

Let A c C Af be a pair of n-dimensional nested lattices 
(see lfl4ll for a rigorous treatment of lattice definitions and 
properties). The lattice A c is referred to as the coarse lattice 
and Af as the fine lattice. Denote by V c the Voronoi region 
of A c , and define the second moment of Ac as 



lurdu. 



nVol(V c ) JueVc 
A nested lattice codebook C = Af n V c , with ratdl 



R = -log|A/nV, 



bits 



channel use 



is constructed from the nested lattice pair. The codebook is 
scaled such that a 2 (A c ) = SNR/2. 



Example 1: Three examples of common nested lattice 
codebooks are given below. More examples can be found 

m ma. 

• Uncoded transmission - The simplest nested lattice code- 
book is an uncoded one, where the fine lattice A f is the 
integer lattice Z whereas the coarse lattice is A c = qL 
for some integer q > 1. The Voronoi region in this case 
is V c = [—q/2, q/2) and the obtained nested lattice code- 
book C consists of all integers in the interval [— q/2, q/2). 
The rate of this codebook is R = \ogq bits/channel use. 

• q-ary linear code without shaping - A more sophisti- 
cated, yet reasonable to implement, nested lattice code- 
book can be obtained by lifting a q-ary linear code with 
block length n to Euclidean space using Construction 
A 11161 , [17 1, and taking the resulting lattice as Af. 
The coarse lattice is taken as A c = qZ™, as in the 
uncoded case. The obtained nested lattice codebook C 
is therefore simply the q-ary linear code coupled with a 
PAM constellation. 

• "Good" nested lattice pair of high dimension - A third 
option is to use a pair of lattices of high dimension where 
the fine lattice is "good" for coding over an AWGN 
channel, whereas the coarse lattice is "good" for mean 
squared error quantization (see [ 14] for precise definitions 
of "goodness"). The obtained nested lattice codebook ad- 
mits a relatively simple performance analysis, that yields 
closed-form rate expressions. However, implementing 
such a codebook is more complicated (although some 
progress in this direction was made in [18]). 

The performance improvement obtained by using such 
a codebook w.r.t. a q-ary linear code without shaping 
is bounded by 1 /2 log(27re/12) bits per real dimension, 
provided that the q-ary linear code performs well over an 
AWGN channel. 

B. Description of IF equalization 

In the IF scheme, the information bits to be transmitted are 
partitioned into 2M streams, labeled {Isr, Is, . . . , Msr, Mg}. 
Each of the 2M streams is encoded by the nested lattice 
code C. In particular, the stream mj{, consisting of nR 
information bits, is mapped to a lattice point t m , R G C. Then, a 
random dither d miJf G K lx ™ uniformly distributed over V c and 
statistically independent of t mm , known to both the transmitter 
and the receiver, is used to produce the signal 



[t, 



v .<" h- i. ''//.' ^ mod A c 



The signal x msR is uniformly distributed over V c and is 
statistically independent of t miR due to the Crypto Lemma [ 14 
Lemma 1]. It follows that 

-E||x m J| 2 - ( T 2 (A c ) = ^-. 
n I 

A similar procedure is used to construct the signal x„ l3 . The 

mth antenna transmits the signal x m = x miR +ix ma G C lx " 

over n consecutive channel uses. 



Let X = fxf • • • x 



T IT c irMxn 



Ml 



G 



The received signal is 



2 All logarithms in this paper are to the base 2. 



Y = HX + Z, 
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where Z G C Nxn is a vector with i.i.d. circularly symmetric 
complex normal entries. Letting the subscripts and 3 denote 
the real and imaginary parts of a matrix, respectively, the 
channel can be expressed by its real-valued representation 



for nota- 

T ]T 







HsR 


Ho " 










Y 3 




Hcj 


HsR 
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Zq 



Y = 



HX 



which will be written as 

tional compactness. Let T = \t{ • • • tj^ ■ ■ ■ t M 

be a 2M x n matrix whose rows consist of the lat- 
tice points representing each of the 2M bit streams, and 
D = [df • • ■ d^ df ■ • • d^J T be a 2M x n matrix 
whose rows correspond to the 2M different dither vectors. 

The IF receiver chooses an equalizing matrix B 6 ^Mx2N 
and a full-rank target integer- valued matrix A £ Z , and 

computes 



Ypff = 



BY + AD 



mod A r 



= AT + (BH - A)X + BZ mod A c 

= [V + Z eff ] mod Ac, (3) 

where V = AT mod A c is a 2M x n matrix with each 
row being a codeword in C owing to the linearity of the 
code, Z eff = (BH - A)X + BZ is additive noise statistically 
independent of V (as X, as well as Z are statistically indepen- 
dent of T), and the notation mod A c is to be understood as 
reducing each row of the obtained matrix modulo the coarse 
lattice. Thus, each row of Y e ff is the output of a modulo 
additive noise channel whose input is a codeword from C 

YeS,k = [vfe + z eff) fc] mod A c , k = 1, . . . , 2M. (4) 

It is evident that the IF receiver transforms the original 
MIMO channel into a set of 2M parallel point-to-point sub- 
channels. The output of each sub-channel can be decoded 
separately. If the decoding is successful over all 2M sub- 
channels, the receiver has access to V, from which it can 
recover the matrix T by solving the (modulo) se0 of equations 
V = AT mod A c . See Figures Q] and |2] 
Define the effective variance of z e ff & as 



2 

"eff,k 



-E ||z effifc || 2 
n 

1 ' (b fc H-a fe )X- 



= — E 
n 
SNR 



ll(bfcH-afc) 



b fc Z 



i||b fc | 



where and a^. are the fcth rows of B and A, respectively. 
A natural criterion for choosing the equalizing matrix B and 
the target integer-valued matrix A is to minimize the effective 

3 The additive noise vectors z e ff i, . . . , z e ff,2M are not statistically indepen- 
dent. Thus, strictly speaking, the 2M effective channels y e tt,l, ■ ■ ■ , Ycff,2M 
are not parallel. However, the IF decoder ignores the correlation between the 
noise vectors and treats the 2Af obtained channels as being parallel. Some 
improvement can be obtained by exploiting these correlations 1191 . Yet, we 
disregard this possibility in the present paper. 

4 In 1 20 1 it is shown that it suffices that A is invertible over M in order to 
recover T from V. 



noise variances. It turns out 0121 that for a given matrix A, 
the optimal choice of B under this criterion is 

- f 1 - ~ N _1 

B °pt _ ah t I __! + hh t 

which results in the effective variances 
SNR 



'eB,k ~ 2 k 



at I + SNRH J H a fe 



(5) 



for k = 1,...,2M. 

Define the effective signal-to-noise ratio (SNR) at the kth 
sub-channel as 

SNR e f t ;fc = g— — 



SNR 

2 



s^ani + SNRH^H) a fc 



= I aM I + SNRH H ) a k 



and let 



SNR ef f^ min SNR eff .fc. 

fc=l,...,2M 



(6) 



For IF equalization to be successful, decoding over all 2M 
sub-channels should be correct. Therefore, the worst sub- 
channel constitutes a bottleneck. For this reason, the total 
performance of the equalizer is dictated by SNR e ff. 

C. Performance of the IF Equalizer 

When the codebook C is constructed from a good pair of 
nested lattices (see Example Q]) the distribution of the effective 
noise at each sub-channel k, which is a linear combination 
of an AWGN and 2AI dither vectors, approaches that of an 
AWGN with zero mean and variance k [21]. Good nested 
lattice codebooks can achieve any rate satisfying 



R< ilog(SNReH, fc ) 



(7) 



over an AWGN modulo channel with signal-to-noise ratio 
SNR e ff fc 03), ||2TI . Since is a codeword from a good nested 
lattice code and z e ff,fc approaches an AWGN in distribution, 
Vfe can be decoded lfT2l . ||2T|| from y e ff,fc as l° n g as the rate 
of the codebook C satisfies (0. It follows that as long as 

R< \ log (SNR ef f), 

all sub-channels k = 1,...,2M can decode their linear 
combinations v& without error, and therefore IF equalization 
can achieve any rate satisfying 

.1 



i? IF <2M-log (SNReff) 
= M log (SNReff). 



(8) 



As mentioned in Example Q] good nested lattice codebooks 
may be difficult to implement in practice. A more appealing 
alternative is therefore to use a q-ary linear code without 
shaping. In this case, the effective noise z e ^ j. at each sub- 
channel is a linear combination of an AWGN and 2 M random 
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Fig. 1. A schematic overview of the integer-forcing transmitter and receiver. For simplicity, the dithers are not depicted in the figure, and a real- valued 
channel is assumed. At the transmitter, the information bits are split to M streams. Each stream is encoded by the same similar linear codebook and transmitted 
by one of the transmit antennas. The receiver first applies the equalizing matrix B which is supposed to equalize the channel H to an equivalent channel 
with transfer matrix approximately equal to A. The equalizer produces M outputs, each of which is an integer-valued linear combination of the transmitted 
codewords plus effective noise. Each one of these outputs is decoded separately, and finally the outputs of the M decoders are multiplied by A" 1 to produce 
the transmitted codewords. The codewords are then mapped to information bits (this step is not depicted in the figure). 
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with uncoded transmission is 



Vk = [vk + Ze&,k] mod -fqZ 
= ["fu k + Zeff.fc] mod jqZ, 
1 



7 \Uk + -Zeft,k 

7 ' 



Uk 



-Zeff.k 



mod 7<?Z, 
mod qL 



Fig. 2. An illustration of the effective channel obtained when integer- 
forcing equalization is used. The effective channel consists of M parallel 
sub-channels. The output of each sub-channel is an integer-valued linear 
combination of lattice points, which is itself a lattice point, plus effective 
noise, modulo the coarse lattice A c . 



where Uk = Vfc/7 G Z. Thus, a detection error occurs only if 



-Zeff.k 



> 



dithers uniformly distributed over the Voronoi region of a 1-D 
integer lattice. This effective noise is i.i.d. (in contrast to the 
case where a higher-dimensional coarse lattice is used where 
z e ff,fc has memory). It was shown in [22 1 that, for a prime q 
large enough, q-ary linear codes without shaping can achieve 
any rate satisfying 

1 1 ( 2ire \ 

i?<-log(SNR eff )--log(^— J 

over a modulo channel with additive effective noise z e ff,fc. 
Therefore, IF equalization using q-axy linear codes without 
shaping can achieve any rate below 

i?IF,q-ary < M log (SNR eff ) - M log f^j (9) 

When a specific q-ary linear code (such as an LDPC code or 
a turbo code) is used, the achievable rate is further degraded 
by 2M times the code's gap-to-capacity at the target error 
probability. 

Finally, consider the case of uncoded transmission. In this 
case, A f = 7 Z and A c = 79Z, where 7 = ^12SNR/(2q 2 ) is 
chosen such as to meet the power constraint, and q > 1 is an 
integer (see Example [T). The output of the fcth sub-channel 



In order to bound the probability of detection error at the fcth 
sub-channel, a simple lemma, which is based on [ 15, Theorem 
7] is needed. 

Lemma 1: Consider the random variable 

L K 

Zeff = atzt + Pkdk 
l=\ fc=l 

where {zi}^ =1 are i.i.d. Gaussian random variables with zero 
mean and some variance a\ and {dk}^ =1 are i.i.d. random 
variables uniformly distributed over the interval [—p/2, p/2) 
for some p > 0. The random variables {ze}f_ 1 are statistically 
independent of {<4}fL r Let a^ff — ^( z eff)- Then 



Pr(z e ff > t) = Pr(z e ff < -r) < exp 




Proof: See Appendix lAl ■ 
Now, using Lemma [TJ the probability of detection error at 



5 



the fcth sub-channel can be bounded as 



the matrix is symmetric and positive definite) 



P E: k = Pi' (h ^ Vk 
1 



< Pr 



< 2 exp ■ 



< 2 exp ■ 



< 2 exp 



-Z e ffh 



> 



8<7 eff,fc j 

3SNR 

4g 2 cr 2 



eff.fc J 
-^SNR eff , fc 



where the definition of SNReff^ was used in the last inequality. 
Using the fact that q = 2 R and that SNR e ff,fc > SNR e ff for all 
k = 1, . . . , 2 M, the detection error probability at each of the 
2M sub-channels can be further bounded as 



P e < 2 exp 



' 2 2(i log(SNR eff ) 



Since the IF equalizer makes an error only if a detection 
error occurred in at least one of the 2M sub-channels, and 
since the total transmission rate is i? IF = 2MR, the total error 
probability of the IF equalizer with uncoded transmission rate 
Rip is bounded by 



P, 



e.IF-uncoded 



<4Mexp j~2 2 (^°s( SNR -)-w)j 
= 4Mexp |_^ 2 ^( M1 °s( SNR -)-^)) . (10) 



Remark 1: Integer forcing equalization with uncoded trans- 
mission is quite similar to the extensively studied lattice- 
reduction-aided linear decoders framework [fPTl , ll23l , lE4l - 
However, two subtle differences should be pointed out. First, 
under the framework of LR-aided linear decoding, the target 
integer matrix A has to be unimodular, i.e., it has to satisfy 
| det(A)| = 1, whereas in IF equalization A is only required 
to be full-rank. Second, the use of the dithers in IF equalization 
results in statistical independence between Vk and z e ff,fc at each 
of the 2M sub-channels. This allows for an exact rigorous 
analysis of the error probability, which is seemingly difficult 
under the LR framework. 



D. Bounding the Effective SNR for an optimal choice of A 

In this subsection, parts of the derivation from [12. Proof 
of Theorem 4] are followed in order to obtain a lower bound 
on SNReff. 

The target integer-valued matrix A should be chosen such 
as to maximize SNR e ff. This criterion translates to choosing 

A opt = argmax SNR eff 

AEZ 2Mx2M ,dct(A)/0 



argmm 



max &T I + SNRH T H 



.2M 



at- 



AeZ 2M x 2J w,dct(A)#0 fe=l. 

Consider the Cholesky decomposition of the matrix 
(l + SNRH T H) (such a decomposition exists since 



I + SNRH T H 



LL 



(11) 



where L is a lower triangular matrix with strictly positive 
diagonal entries. With this notation the optimization criterion 
becomes 

A opt = argmin max ||L T afc|| 2 . 

AeZ 2Afx2M ,d ct (A)/0 fcL - 2M 

Denote by A(L T ) the 2M dimensional lattice spanned by the 
matrix L T , i.e., 

A(L T ) = |L T a : a e Z 2M } . 

It follows that A opt should consist of the set of 2M linearly 
independent integer-valued vectors that result in the shortest 
set of linearly independent lattice vectors in A(L T ). 

Definition 1 (Successive minima): Let A(L T ) be a lattice 



spanned by the full-rank matrix L e 



p2Mx2Af 



For 



k = 1, 



. , 2M, we define the fcth successive minimum as 



A fc (L T )^inf|r : dim (span (A(L T ) p| B{0, r))) > fcj 

where 23(0, r) = {x e R 2M : ||x|| < r} is the closed ball of 
radius r around 0. In words, the fcth successive minimum of 
a lattice is the minimal radius of a ball centered around that 
contains k linearly independent lattice points. 

With the above definition of successive minima, the effective 
signal-to-noise ratio, when the optimal integer-valued matrix 
A opt is used, can be written as 



SNR eff = 



1 



^a/(L t )- 



(12) 



Bounding the value of the 2Mth successive minima of a 
lattice is seemingly difficult. Fortunately, the theory of lattices 
connects the 2Mth successive minima of a lattice to the first 
successive minima of its dual lattice. 

Definition 2 (Dual lattice): For a lattice A(G) with a gen- 
erating full-rank matrix G 6 R 2Mx2M the dual lattice is 
defined by 

A*(G) = A ((G T ) _1 ) 

= {(G T )- 1 a : aeZ 2M }. 



Lemma 2 ( j |25] Proposition 3.3]): Let A(G) be a lattice 
with a full-rank generating matrix G S R 2Mx2M and let 
A*(G) = A ((G T ) _1 ) be its dual lattice. The successive 
minima of A(G) and A*(G) satisfy the following inequality 



Xj (G) \\ ((G T ) _1 ) < (2M) 3 , W = 1,2,. 



Proof: See l25l 



,2M. 
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The dual lattice of A(L T ) is A(L 1 ). Applying Lemma [2] 
to (fT2] i gives 

SNR eff : ' 



> 



a!m(l t ) 

(2M)^ A ' (L_1: 
1 



IL" 1 ! 



8M 3 aez 2M \o 

——r min a T (LL T ) _1 a 

' min a T (l + SNRH T H) a 



(13) 



8M 3 aez 2M \o 

where ([T3l follows from (ITTb . 

The performance of IF equalization over Rayleigh fading 
channels was studied in 1121 . Using bounding techniques 
similar to those used in this subsection, it was shown that the 
IF equalizer achieves the optimal receive DMT (corresponding 
to transmission of independent streams from each antenna) 
when N > M. Nevertheless, there are clearly instances of 
MIMO channels for which the lower bound ( fT3l on SNR e ff 
does not increase with the WI mutual information. For ex- 
ample, consider a channel H where one of the NM entries 
equals h whereas all other gains are zero. For such a channel 
Cwi = log(l + |/i| 2 SNR), yet SNR eff = 1 (and the bound (O 
only gives SNR eff > 1/(8A/ 3 )). Thus, it is evident that IF 
equalization alone can perform arbitrarily far from Cwi- 

As shown in the sequel, this problem can be overcome by 
transmitting linear combinations of multiple streams from each 
antenna. More precisely, instead of transmitting 2M linearly 
coded streams, one from the in-phase component and one from 
the quadrature component of each antenna, over n channel 
uses, 2MT linearly coded streams are precoded by a unitary 
matrix and transmitted over nT channel uses. 

E. Linearly Precoded Integer-Forcing Equalization 

Very recently Domanovitz et al. [26 1 proposed to com- 
bine IF equalization with linear precoding. The idea is to 
transform the N X M complex MIMO channel l|T|) into an 
aggregate NT x MT complex MIMO channel and then apply 
IF equalization to the aggregate channel. The transformation 
is done using a unitary precoding matrix P € ^a/txa/t 
Specifically, let x 6 C MTxl be the input vector to the 
aggregate channel. This vector is multiplied by P to form 
the vector x = Px € c*/Txi whi^ j s transmitted over the 
channel (Q~|) during T consecutive channel uses. Let 

H ••• 
H 



H = h 



H 



(14) 



••■ H 

where ® denotes the Kronecker product. The output of the 
aggregate channel is obtained by stacking T consecutive 
outputs of the channel (Q~|) one below the other and is given 
by 

y = HPx + z 

= Hx + z, (15) 



where H = HP = (It <£> H)P G C WTxMT is the aggregate 
channel matrix, and z G <C NTxl is a vector of i.i.d. circularly 
symmetric complex normal noise. See Figure [3] 

A remaining major challenge is how to choose the precoding 
matrix P (recall that an open-loop scenario is considered, and 
hence the choice of P cannot depend on H). As observed in 
Section III-C1 the performance of the IF equalizer is dictated 
by SNR e ff. Thus, in order to obtain achievable rates that are 
comparable to the WI mutual information, SNR e ff must scale 
appropriately with Cwi- The precoding matrix P should be 
chosen such as to guarantee this property for all channel 
matrices with the same WI mutual information. 

It turns out that the well-developed theory of linear disper- 
sion space-time codes provides excellent candidates for P that 
satisfy the aforementioned criterion. In [26 1 the performance 
of IF equalization with the golden code's ifTTIl precoding 
matrix was numerically evaluated in a 2 x 2 MIMO Rayleigh 
fading environment. The scheme's outage probability was 
found to be relatively close to that achieved by white i.i.d. 
Gaussian codebooks. Here, we prove that, in fact, precoded IF 
equalization, where the precoding matrix generates a perfect 
linear dispersion space-time code, achieves rates within a 
constant gap from the WI mutual information of any MIMO 
channel. 

The next section provides some necessary background on 
space-time codes, needed for the proof of our main result. 

III. Linear Dispersion Space-Time Codes 

An M x T space-time (ST) code C ST for the channel (QJ with 
rate R is a set of |C ST | = 2 RT complex matrices of dimensions 
M x T, The codebook C ST has to satisfy the average power 
constrain^ 

1 



2-RT E 

x fc ec ST 



|X fc ||f. < MT ■ SNR. 



When the ST code C ST is used, a code matrix X G C ST is 
transmitted column by column over T consecutive channel 
uses, such that the T channel outputs can be expressed as 

Y = HX + Z, 

where each column of the matrices Y, Z G C NxT represents 
the channel output and additive noise, respectively, at one of 
the T channel uses. 

An ST code C ST is said to be a linear dispersion ST 
code ll27l over the constellation S if every code matrix 
X G C ST can be uniquely decomposed as 



X 



K 

E 

k=l 



SfeFfe, s k G <S, 



where S is some constellation and the matrices Ffc G C MxT 
are fixed and independent of the constellation symbols s^. De- 
noting by vec(X) the vector obtained by stacking the columns 
of X one below the other, and letting s = [si ■ ■ ■ sk] T gives 

vec(X) = Ps, 

5 The Forbenius norm of a matrix X is denoted by ||X|| p. 
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Fig. 3. A schematic overview of linearly precoded integer- 
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brcing. For simplicity, the dithers are not depicted in the figure, and a real-valued channel is 



assumed. At the transmitter, the information bits are split to TM streams, each of which is encoded by a similar linear code. Then, a TM X TM precoding 
matrix "mixes" the TM codewords into TM linear combinations. The channel H is used T times, where in each channel use each of the antennas transmits 
one of the precoded linear combinations. The receiver treats T consecutive channel outputs as the output of an aggregate AT X MT channel with transfer 
matrix H = (Iy H)P, and applies integer-forcing equalization to the aggregate channel. 



where 



[vec(Fi) vec(F 2 ) ••• vec(F K )] 



is the code's MT x K generating matrix. A linear dispersion 
ST code is full-rate if K = MT. In the sequel, linear 
dispersion codes over an 1? — QAM constellation will play a 
key role. Here, an 1? — QAM constellation is defined as 

{—L, —L + l,...,L — l,L} + i {—L, —L + l,...,L — l,L} 

for some positive integer L. The linear dispersion ST code 
obtained by using the infinite constellation iSqo = Z + iL 
is referred to as CjJ, and, after vectorization, is in fact a 
complex lattice with generating matrix P. Since the L 2 — QAM 
constellation is a subset of Z + iL it follows that for any finite 
L the L 2 - QAM based code C ST is a subset of C^. 

An important class of linear dispersion ST codes is that of 
perfect codes [8|, [9| which is defined next. 

Definition 3: An M x M linear dispersion ST code over a 
QAM constellation is called perfect if 

1) It is full-rate; 

2) It satisfies the nonvanishing determinant criterion 



^min (^i 



ST\ 



mm 



det(X)| 2 > 0; 



3) The code's generating matrix is unitary, i.e., Ptp = I. 

Note that this definition is slightly different than the one 
used in (8), (9), where instead of condition [3] it is required that 
the energy of the codeword corresponding to the information 
symbols s will have the same energy as ||s|| 2 , and that all the 
coded symbols in all T time slots will have the same average 
energy. 



In 00, perfect linear dispersion ST codes were found for 
M = 2,3,4 and 6, whereas in [9] perfect linear dispersion 
ST codes were obtained for any positive integer M. The 
constructions in J8), |[9) are based on cyclic division algebras, 
and results in a unitary generating matrix P. Thus, for any 
positive integer M, there exist codes that are perfect according 
to Definition [3] 

The approximate universality of an ST code over the MIMO 
channel was studied in (6). This property refers to an ST 
code being optimal in terms of DMT regardless of the fading 
statistics of H. A sufficient and necessary condition for an ST 
code to be approximately universal was derived in [6. Theorem 
3.1]. This condition is closely related to the nonvanishing 
determinant criterion. The next Theorem is a simple extension 
of (6] Theorem 3.1]. The notation [x} + = max(a;,0) is used. 

Theorem 1: Let C ST be an M x M perfect linear dis- 
persion ST code over an 1? — QAM constellation with 
^min(C^) = mirio^xec^ | det(X)| 2 > 0. Then, for all chan- 
nel matrices H with corresponding WI mutual information 
Cwi = logdet(I + SNRHtH) and all ^ X e C ST 



SNR||HX|| 2 P > 



2M 2 L 2 



The proof closely follows that of [6 Theorem 3.1]. 
Proof: Consider some arbitrary O^Xe C ST and let 

H = Ui*Vj and X = U 2 AV^ 

be the singular value decompositions (SVD) of H and X, 
respectively. With this notation 



SNR||HX||| = SNR||*VjU 2 A|||. 



(16) 



s 



Suppose the (absolute) singular values are ordered by increas- 
ing value in A and by decreasing value in 

A = diag{Ai,...,A A f}, 

* = diag{-0i,---,Vw°>-" >°h 

where m„ = min{M, N}. In order to establish the desired 
result one has to find the channel H with corresponding WI 
mutual information Cwi that minimizes (TToT l. The rotation 
matrix Vi that minimizes ( TToT l is Vi = U2 which aligns the 
weaker singular values of the channel matrix with the stronger 
singular values of the code matrix ll28l . Thus, the problem of 
finding the worst channel matrix H w.r.t. the codeword X 
reduces to the optimization problem 



for all TH = 1,...,M. Without loss of generality we may 
assume that 2M 2 L 2 < 5 min (C^)' n '2~ as otherwise the the- 
orem is trivial. Since by definition 

|A 1 ---A M | 2 = |dct(X)| 2 >5 mm (0, 

we have for all m = 1, . . . , M 

I A, 



I 2 < 2M 2 L 2 



< |Ai • • • Am I M 2 M ■ 



Thus, the solution for < f20b is 



min SNRV |^ m | 2 |A r( 



i/>i,...,i/> 



m= 1 



. . 2 Cy/l 

|Ai---A M | M 2 ** . 



(23) 



(24) 



subject to ^log(l + foM 2 SNR) = Cwi- (17) Substituting (fJH into g5J gives 



m— 1 



A lower bound on the solution of the minimization prob- 
lem ( TT7| > can be obtained by replacing m n with M > m n , 
which increases the optimization space and results in 



M 



min SNR V \ip m \ 2 \K 

lj)l,...,lpM — ' 

m— 1 



M 



subject to !og(l + |^™| 2 SNR) = C m . (18) 



m— 1 



The solution to (fT8l is given by standard water-filling J6] 



A/ 



SNR||HX||£> 



where A satisfies 



M 

E 



log 



A I A, 



v — |A m | 2 



(19) 



(20) 



Recall that X is a codeword from a perfect linear dispersion 
ST code over an L 2 — QAM constellation. Let P be the 
generating matrix of the code C ST . Thus, vec(X) = Ps 
for some vector s whose M 2 components all belong to the 
L 2 — QAM constellation. This implies that 



m— 1 



\ 2 = \\m 

= II vec(X)|| 2 
= HPsf 

= NI 2 

< 2M 2 L 2 , 



(21) 
(22) 



where (fJTJ follows from the fact that P is unitary. In partic- 
ular, (l22l implies that 

|A m | 2 < 2M 2 L 2 



SNRIIHXII 



> 

> 
> 



M 

E 

m— 1 



M|Ai • ■ • A M | lt3 '2— 

M6 min (C S J)^2^-2M 2 L 2 ] 
S mm (C^2^ 



|A m | : 



2M 2 L 2 



as desired. ■ 

Let H = Im ® H, as in ( TT~4-b . The next simple corollary of 
Theorem [T] will be used in Section|lV]to prove the main result 
of this paper. 

Corollary 1: Let P be the M 2 x M 2 generating matrix of 
a perfect linear dispersion code C ST over a QAM constellation 
with <5 min (C^) = mirio^xecg |det(X)| 2 > 0. Then, for all 
channel matrices H with corresponding white input capac- 
ity Cwi = logdet(I + SNRlPH) and all complex vectors 
a G Z M + tZ M with all real and imaginary entries not 
greater than L 



SNR||^Pa|| 2 > 



S mm (C s ^2^ 



2M 2 L 2 



Proof: Assume the constellation over which the code 
C ST is defined is an L 2 — QAM one. Then, for any 
a G Z M + i1 M with all real and imaginary entries not 
greater than L there exist a code matrix X G C ST such that 



vec(X) 



Pa. 



Now, 



SNR||WPa|| 2 = SNR||ft vec(X) 
= SNR||HX|| 2 P 



> 



S mm (C s ^2^ L -2M 2 L 2 
where the last inequality follows from Theorem Q] 
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IV. Main Result 

The next theorem lower bounds the effective signal-to-noise 
ratio of linearly precoded IF equalization, where the precoding 
matrix generates a perfect linear dispersion ST code. The 
obtained bound depends on the channel matrix H only through 
its corresponding WI mutual information. 

Theorem 2: Consider the aggregate MIMO channel 

y = "HPx + z 

where U = I M ® H e C*""*^, and P e C m2 * m * is 
a generating matrix of a perfect M x M QAM based ST 
code C ST with <5 mi „(0 = min o#xecS I det(X)| 2 > 0. Then, 
when IF equalization is applied to the aggregate channel 



SNReff > 



1 



1_ Cwi 



16M 8 

for all channel matrices H with corresponding WI mutual 
information Cm = logdet(I + SNRIPH). 

Proof: Let H = HP and consider its real-valued repre- 
sentation 

Hq Hsr 

It follows from (|T3T > that the effective-signal-to-noise ratio 
when IF equalization is applied to the aggregate channel is 
upper bounded by 
1 



H 



SNR eff > 



8A/ 6 



a T (I 



aGZ 2M \0 



SNRH H a 



(25) 



• 2M'x2Ar 



Since the matrix (l + SNRH T HJ e K""* is 
the real-valued representation of the complex matrix 
(I + SNRH T H) e C m2 * m \ (E3 can be written in complex 
form as 

SNR ef f >— — - min a T 

8M b a ez M2 +iZ M2 \o 



1 



8M 6 aeZ M2 +iZ M2 \0 

1 



8M 6 



SNRH T H) a 

-SNR||Ha|| 2 ) 
-SNRII-HPall 2 ) 



& ei, M +a M \o 



(26) 



For a vector a' € Z M + iL M \ assume that the maximum 
absolute value of all its real and complex entries is L, i.e., 

L = max [ja'^j a^|] . 

The norm ||a'|| 2 is trivially bounded from below by L 2 and 
by Corollary Q] 

SNR||HPa'|| 2 > L»(CL t )w2t — 2M 2 L 2 
Therefore, (j26j can be further bounded by 



SNReff > 



> 



1 



8M 6 L=l,2 

1 



min I L 2 



1 g WI 



«min(C)*2^ -2M 2 L 2 



> 



8M 6 L=l,2 

1 



min L 



2M 2 



5 min (Cf T )*2T 



16M 8 



as desired. ■ 

The next theorem is the main result of this paper. 

Theorem 3: For all channel matrices H with M transmit 
antennas and an arbitrary number of receive antennas, the 
achievable rate of linearly precoded IF equalization is a 
constant gap from the closed-loop capacity of the channel, 
provided that the precoding matrix P generates an M x M 
perfect linear dispersion space-time code. 

Proof: Let C W i = logdet(I + SNRHTH) be the WI mu- 
tual information corresponding to H, and let P be a gen- 
erating matrix of a perfect M x M QAM based code C ST 
with 5 m ; n (C^) = min ^xeC ST I det(X)| 2 > 0. Such a matrix 
P exists for any positive integer M J9). The matrix P is used 
as a precoding matrix that transforms the original N x M 
MIMO channel (B to the aggregate NM x M 2 MIMO channel 



y = HPx + z 
= Hx + z, 



(27) 



as described in Section llLEl and then IF equalization is applied 
to the aggregate channel. Assuming a "good" nested lattice 
codebook is used to encode all 2M 2 streams transmitted over 
the aggregate channel, by ®, IF equalization can achieve any 
rate satisfying 

-^IF.aggregate 

< M 2 log(SNReff). 
Using Theorem |2] it follows that any rate satisfying 

1 



-RlF.aggregate < M log 



16M 8 

MC W i - M\og 



1 



^min (C, 



ST\ 

oc / 



- M log(16M 8 ) 
(28) 



is achievable over the aggregate channel. 

Since each channel use of the aggregate channel (|27i > 
corresponds to M channel uses of the original channel ((T), 
the communication rate should be normalized by a factor of 
1/M. Thus, linearly precoded IF equalization can achieve any 
rate satisfying 



WI 



log 



1 



<-'imn 



Mlog(16M 8 ) 



Cm ~ T (<5min(Coo)) M 



where 



r(<5 min O,M) 4 log 



1 



Mlog(16M 8 ) (29) 



is a constant independent of the channel H and SNR. Since 
the gap between Cwi and the closed-loop capacity is bounded 
by a constant number of bits, the theorem is established. ■ 

Note that although the proof of Theorem [3] assumed that 
a "good" nested lattice code was used, a similar result holds 
when a g-ary linear code without shaping is used. This follows 
from the fact that the latter code loses only the shaping loss 
of log(27re/12) bits per antenna w.r.t. the former. Moreover, 
Theorem [2] can also be used to obtain an upper bound on the 
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performance of linearly precoded IF equalization with uncoded 
transmission. 

Proposition 1: For all channel matrices H with correspond- 
ing WI mutual information C m = logdet(I + SNREUH), M 
transmit antennas and an arbitrary number of receive antennas, 
the error probability of linearly precoded IF equalization with 
uncoded transmission is bounded by 

d ^/i!\^2„ J 3 i t c R p _ IF _r((S mi„(c5J) ,ai)) \ 

-T^P-IF-uncoded < 4AZ OXp < ~ -2 M ^ wl 11 ^ "> 

provided that the precoding matrix P generates an 
M x M perfect linear dispersion space-time code C ST with 

S min (C^) = mino^xecsj | det(X)| 2 > 0. 

Proof: Using (ITOb . the error probability of uncoded IF 
equalization over the aggregate channel (l27t is bounded by 

P < A M 2 a™ / 3 9777 ( m2 log(SNR elr )-Miip.n.) \ 

-fe,P-IF-uncoded S ™ exp <—~ZM*\ I V , 

where we have used the fact that the transmission rate over 
the aggregate channel is M times larger than the actual 
communication rate i?p_ip. Now, replacing SNR e ff with its 
bound from Theorem [2] establishes the proposition. ■ 

V. Discussion and Summary 

The additive Gaussian noise MIMO channel in an open- 
loop scenario, where the receiver has complete channel state 
information whereas the transmitter has no channel state 
information was considered in this paper. It was shown that 
using linear precoding at the transmitter in conjunction with 
integer-forcing equalization at the receiver suffices to approach 
the closed-loop capacity of this channel to within a constant 
gap, regardless of the channel matrix H. To the best of our 
knowledge, this is the first practical scheme that guarantees 
only an additive loss w.r.t. capacity. Such a performance 
guarantee is much stronger than DMT optimality, which is 
at present the common benchmark for evaluating schemes. In 
particular, although our results are free from any statistical 
assumptions, they can be interpreted to obtain performance 
guarantees in a MIMO fading environment. Specifically, a 
scheme that achieves a constant gap from capacity is DMT 
optimal under any fading statistics, and achieves a constant 
gap from the outage capacity under any fading statistics. 

IF equalization uses coded streams, and is therefore usually 
less suitable for fast fading environments. Nevertheless, we 
have also developed new upper bounds on an uncoded version 
of IF equalization, which is more adequate for fast fading. 
We note that while uncoded IF equalization is quite similar to 
lattice reduction aided decoding, to the best of our knowledge, 
the performance of the latter was never studied at such a fine 
scale. 

Another appealing feature of the described scheme is that 
is is independent of the number of receive antennas, and the 
performance guarantees obtained in this paper do not depend 
on the number of receive antennas as well. Hence, the scheme 
is not sensitive to a degrees-of-freedom mismatch. 
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Appendix A 
Proof of LemmaQ] 

The probability density function of z e ff is symmetric around 
zero and hence 

Pr(z eff > t) = PrO eff < -r). 
Applying the Chernoff bound gives (for s > 0) 

Pr(z eff > r) < e~ ST E (e sz <") 

L K 

= e- ST Y[E(e so " z ') ]]E(e* 4 ) . 

£=1 k=l 

Using the well-known expressions for the moment generating 
function of Gaussian and uniform random variables gives 

E{e saez >) = e^ s2a ^, 
. M sinh^/2) < 
V ' s/3 k p/2 ~ 

where the last inequality follows from 
sinh(x)/a; < exp{cc 2 /6} (which can be obtained by simple 
Taylor expansion) [15|. It follows that 

= g-sr+Is 2 ^ (3()) 

Setting s = 7"/(7 2 ff gives the desired result. 
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